STIC Biotechnology Systems Branch 



CRF Problem Report 

The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) experienced a problem when processing the following computer 
readable form (CRF): 



STIC Contact: Mark Spencer: Telephone: 571-272-2510; Fax: 571-273-0221 
Nature of CRF Problem: 

(circle one) Damaged or Unreadable (for Unreadable, see attached) 

Blank (no files on CRF) (see attached) 

Empty file (filename present, but no bytes in file) (see attached) 

Wrong file saved to CRF (invention title, docket number, or applicant(s) do not 

match those in official application) (see attached) 
Not saved in ASCII text 

Sequence Listing was embedded in the file. According to Sequence Rules, 

submitted file should only be the Sequence Listing. 
Did not contain a Sequence Listing, (see attached sample) * 



PLEASE USE THE CHECKER VERSION 4.4.0 PROGRAM TO REDUCE ERRORS. 
SEE BELOW FOR ADDRESS: 

hnp:/w^w. uspto.gov/web/ofrices/pac/checker/chkrnote.htm 

1. EFS-Bio (<http://www.uspto,gov/ebc/efs/downloads/documents.htm> . EFS Submission 

User Manual - ePAVE) 
Z U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 
3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 01/14/05): 

U.S. Patent and Trademark Office, Mail Stop Sequence, Customer Window, Randolph Building, 401 Dulany Street, 

Alexandria, VA 22314 

Revised 01/20/06 



Application Serial Number: 
Filing Date: 

Date Processed by STIC: 




BEST AVAILABLE COPY 



Confidential 8/22/2003 Xin-Hua Feng and Yingxian Xiao 



SEQUENCB LISTING 

(1) GENERAL INFORMATION: 
(iii) NUMBER OF SEQUENCES : 9 

(2) INFORMATION FOR SEQ ID NO 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 



(D) TOPOLOGY: linear «r<T7 9 s,\ ,^Jj S OI 

(ii) MOLECULAR TYPE: DNA f njL£> £° <& tn ^ /C *^^ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 J / ' — 

AACggatccaaaacgctgcctccgcga 27 S<^QjUuC4^C€^ /v>A*WP 

(2) INFORMATION FOR SEQ ID NO: 2: Y v / 



(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 25 

(B) TYPE: nucleic 

(C) STRANDEDNESS 



ARACT ERISTICS : fj g <f \ g a / / 



J j9 

(C) STRANDEDNESS: single i & ^ <~ s % I fi * t 

(D) TOPOLOGY: linear / )f rkJjJU CCO*'*' &>~> A* 
(ii) MOLECULAR TYPE: DNA L/-> ^/V O ~ «» 
(iv) ANTI-SENSE: YES 'l— I ^ gj^r^ CX D^O*- 

NCE DESCRIPTION: SEQ ID NO: 2 CjOc*VT7~*& ' (^f CZ^T' *S ^ 



(xi) SEQUENCE 

TAGACGCTGCAGGAGGCGCCTGGCT 25 



(A) LENGraT"2r9^se"pairs^ " , ] lj 1*^ ID***-*^ ^ >C -\ 

(B) TYPE: nucleic acid ^W^JT * <l #./ffv> 

(C) STRANDEDNESS: single ^ 0 9^1 up. 1 ^Ve^Q /AiU^CC A.CtA-W 

(D) TOPOLOGY: linear -£^7 /XJC^/ *Z « 0^^ M .t ~ 
(ii) MOLECULAR TYPE: DNA JL s>>^ I ^^TtkjL^ 3 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 ^V777 W • ' f ^ 

ggatccaaaacgctgcctccgcgacagggcggaggacggagggcgtcccaggatcgtggg 60 / *v ^ 
ccctgggcctgacgcctcggagcactccctgctccgagcgggcccgatgtggtggaagct 120 (^L£ lx V^-y -*C7 
cgggagcgcgggagccgggggaaggccgcgggcagccgtcgggggtccccgatccgagcc 180 ^/ 
ccgcggccccgggctggcggtgtcggctgcaatccggcgggcacggccggccgggctggg 240 ( U \ Q (L*s%j£&- 
ctcttggggcagccaggcgcctccttcag 269 f\TOJ ^<Z^£ AlM * / ^^ 

(2) INFORMATION FOR SEQ ID NO: 4: 

(1) SEQUENCE CHARACTERISTICS: JSjLAJiJL^ X "° 

(A) LENGTH: 84 base pairs " / 

(B) TYPE: nucleic acid ^ Jcs>*^+ 

(C) STRANDEDNESS: single /A cl^/ ^ 

(D) TOPOLOGY: linear ^ ✓ . ^ * 
(ii) MOLECULAR TYPE: DNA C ^ ^T>LaJBJL^C-<- (^^<^J 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 *^ V 
AGAAGACGAagctaagcagggtcgggcctggttagtacttggatgggagaccgcctggga 60 ^ ^ A fi <*< 
ataccgggtgctgtaggctttttg 84 C%ju%k cfh*-£& Jr^~ 

(2) INFORMATION FOR SEQ ID NO: 5: At fit ~t 

(1) SEQUENCE CHARACTERISTICS: 4 A ff ^1 ^ /Yl W\tvX' * 

(A) LENGTH: 88 base pairs \S&JL* K±{ TOI/^)V^ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULAR TYPE: DNA 
(iv) ANTI- SENSE: YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

TCGAOUUUIAGCCTAC^GCACCCGGTATTCC^ 60 
CKSCCCX^CCCTGCTTAGCTTCGTCTTCT 88 

(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 base pairs 

(B) TYPE: nucleic acid 



1 



<110> 

<120> 

<130> 

<140> 
<141> 

<150> 
<151> 

<160> 

<170> 



<210> 

<2n> 

<212> 
<213> 



<220> 
<221> 
<222> 

<300> 
<301> 
<302> 

<303> 
<304> 
<305> 
<306> 
<307> 
<308> 
<309> 



<400> 

agctgtagtc 
agggagagtg 
tgatgtggca 
cgcggcgcgg 



Smith, John; Smtthgene Inc. 
Example of a Sequence Listing 
01-00001 



PCT/EP98/00OO1 
1998-12-31 



US 08/999,999 
1991-10-15 



Patent rn version 2.0 



1 

389 
ONA 

Paramecium sp. 



CDS 

(279) . . . (389) 



Doe, Richard 

Isolation and Characterization of a Gene Encoding a 

Protease from Paramecium sp. 

Journal of Genes 

1 

4 

1-7 

1988-06-31 

123456 

1988-06-31 



attcctgtgt cctcctctct ctgggcttct caccctgcta atcagatccc 

tcttgaccct cctctgcctt tgeagcttea caggcaggea ggcaggcagc 

attgetggea gtgecacagg cttttcagcc aggcttaggg tgggttccgc 

cggcccctct cgcgctcctc tcgcgcctct ctctcgctct cctctcgctc 



60 
120 
180 
240 



ggacctgatt aggtgagcag gaggaggggg cagttagc atg gtt tea atg etc age 296 

Met Val Ser Met Phe Ser 
1 5 

ttg tct ttc aaa tgg cct gga ttt tgt ttg ttt gtt tgt ttg ttc eaa 344 
Leu Ser Phe Lys Trp Pro Gly Phe Cys Leu Phe Val Cys Leu Phe Gin 
10 15 20 

tgt ccc aaa gtc etc ccc tgt " cac tea tea ctg cag ccg aat ctt 389 
Cys Pro Lys Val Leu Pro Cys His Ser Ser Leu Gin Pro Asn Leu * 
15 30 35 

<210> 2 

<211> 37 

<212> PRT 

<213> Paramecium sp. 

<400> 2 

Met Val Ser Met Phe Ser Leu Ser Phe Lys Trp Pro Gly Phe Cys Leu 
15 10 15 

Phe Val Cys Leu Phe Gin Cys Pro Lys val Leu Pro Cys His Ser Ser 
20 25 30 

Leu Cln Pro Asn Leu 
35 

-<210> 3 
<211> 11 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Designed peptide based on size and polarity to act as a 

linker between the alpha and beta chains of Protein XY2 . 

<400> 3 

Met Val Asn Leu Glu Pro Met His Thr Glu He 
I 5 10 



<210> 
<400> 
000 



4 
4 



*Requiremeau for Applications - OG Date: 23 Jane 1998 



http-7/vAvw.u spto . go v/urb/o fHccs/co m/sol/og/ ! 998Avedc2 Vpatrcqriitm 



cutu Liieii <tcourapanyxn9 xmannd.uun as snown xu cue lonowiay 
table. The numeric identifier shall be used only in the "Sequence 
Listing." The order and presentation of the items of information in the 
"Sequence Listing" shall conform to the arrangement given below. Bach 
item of information shall begin on a new line and shall begin with the 
numeric identifier enclosed in angle brackets as shown. The submission 
of those items of information designated with an "M" is mandatory. The 
submission of those items of information designated with an "O" is 
optional. Numeric identifiers <110> through <170> shall only be set 
forth at the beginning of the "Sequence Listing." The following table 
illustrates the numeric identifiers. 



Numeric 
Identifier 



Definition 



Comments and 
Format 



Mandatory (M) 
Optional JO) 



or 



<110> 



Applicant 



Preferably max. 
of 10 names; 
one name per~ line; 
preferable format: 
Surname, Other 
Names and/or 
Initials 



<120> 
<130> 



Title of 
Invention 

File Reference 



Personal' file 
reference 



M 



M when filed prior 
to assignment of 
appl . number 



<140> 



Current Applica- 
tion Number 



Specify as : 

US 07/999,999 or 

PCT/US96/99999 



M, if available 



<141> 



Current Filing 
Date 



Specify as:- yyyy-mm-dd M, if available 



<1S0> 



Prior Application Specify as: 
Number US 07/999,999 or 

PCT/US96/99999 



M, if applicable 
include priority 
documents under 
35 USC 119 and 
120 



<151> 
<160> 



Prior Application Specify as: 
Filing Date 



yyyy-mm-dd M, if applicable 



Number of SEQ ID 
NOs 



Count includes 
total number of 
SEQ ID NOs' 



M 



<170> 



Software 



Name of software used 
to create the 
Sequence Listing 



<210> 



SEQ ID NO: # : 



Response shall be an 
integer repre- 
senting the SEQ 
ID NO shown 



M 



<211> 



Length 



Respond with an integer M 
expressing the number 
of bases or amino acid 
residues 



of 34 
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. Requirements for AppKcatioas -OG Date 23 June 1998 
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<212> Type 



<213> Organism 



<220> Feature 



Whether presented 
sequence mole- 
cule is DNA, 
RNA, or PRT 
(protein)* If 
a nucleotide 
sequence con- 
tains both DNA 
and RNA frag- 
ments , the 
type shall be 
-DNA. tt In ad- 
dition, the 
combined DNA/ 
UNA molecule 
shall be further 
described in 
the <220> to 
<223> feature 
section* 

Scientific name, 
i.e. Genus/species , 
Unknown or Artifi- 
cial Sequence.. In 
addition, the 
"Unknown" or 
"Artificial Se- 
quence" organisms 
shall be further 
described in the 
<220> to <223> 
feature section. 

Leave blank~after 
<220>. <221-223> 
provide for a 
description of 
points of bio- 
logical signi- 
ficance in the 
sequence . 



<221> 



Name/ Key 



Provide appropriate 
identifier for 
feature, pre- 
ferably from 
wipo Standard 
ST. 25 (1998), 
Appendix 2, 
Tables S and 6 



<222> 



Location 



Specify location 
within sequence; 
where appropriate 
state number of 
first and last 
bases/amino acids 



29 of 34 



M 



M 



M, under the 
following condi- 
tions: if "n, " 
"Xaa, " or a mod- 
ified or unusual 
L-araino acid or 
modified base was 
used in a se- 
quence; if ORGAN- 
ISM is "Artifi- 
cial Sequence" or 
"Unknown"; if 
molecule is 
combined DNA/RNA. 

M, under the fol- 
lowing* conditions: 
if "n, " "Xaa # " or 
a modified or un- 
usual L-amino _ 
acid or modified 
base was used in 
a sequence 

M, under the fol- 
lowing conditions: 
if "n, " "Xaa," or 
a modified or un- 
usual L-amino 
acid or modified 



1/29/99 !:53 PM 



• x-i-s* tAIK. «V 1770 



nn|t/i w^-wjjspio^ov/wctyouices/oom/sot/og/ 1 y*/8/wee*2 S/patre^htra 



<223> 



Other Infor- 
mation 



<300> 
<301> 



Publication 
Information 

Authors 



<302> 
<303> 
<304> 
<305> 
<306> 
<307> 



<308> 



<309> 



<310> 



Title 

Journal 

Volume 

Issue 

Pages 

Date 



Database 

Accession 

Number 



Database Entry 
Date 



Patent Document 
Number . 



Other relevant 

information; 

four' lines maximum 



in feature base was used in 

a sequence 

M, under the fol- 
lowing conditions: 
if *n," "Xaa," or 
a modified or un- 
usual L-amino acid 
or modified base 
was used in a 
sequence; if 
ORGANISM 
is "Artificial 
Sequence" or 
"Unknown"; if 
molecule is com- 
bined DNA/RNA. 

Leave blank O 
after <300> 

. Preferably max O 
of ten named 
authors of publi- 
cation; specify, 
one name per line; 
preferable format: 
Surname, Other 
Names and/ or 
Initials 

° 
O 

o 

. o 
o 

Journal date on which O 
data published; 
specify as yyyy-mm- 
dd, MMM-yyyy or 
Season-yyyy 

Accession number o 
assigned by data- 
base including 
database name 

Date of entry in o 
database; specify 
as yyyy-mm-dd or 
MMM-yyyy 

Document number; o 
for patent-type 
citations only. 
Specify as, for 
example, US 

07/999,999 . 
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<311> 
<312> 

<313> 
<400> 



Patent Filing 
Date 



Publication Oate 



Relevant 
Residues 

Sequence 



Document filing 
date, for patent- 
type citations only; 
specify as yyyy-mm-dd 

Document publication 
date, for 
patent-type 
citations only; 
specify as yyyy-mm-dd 

FROM (position) TO 
(position) 

SEQ ID NO should 
follow the 
numeric identifier 
and should appear 
on the line pre- 
ceding the actual 
sequence 



M 



S. Section 1.824 is revised to read as follows: 

1.624 Form and format for nucleotide and/or amino acid sequence 
submissions in computer readable form. 

(a) The computer readable form required by 1.821(e) shall meet the 
following specifications: 

(1) The computer readable form shall contain a single "Sequence Listing*' 
as either a diskette, series of diskettes, or other permissible media * 
outlined in paragraph (c) of this, section. 

(2) The "Sequence Listing" in paragraph (a) (1) of this section shall be 
submitted in American Standard Code for Information Interchange (ASCII) 
text. No other formats shall be allowed. 

(3) The computer readable form may be created by any means, such as word 
processors, nucleotide /ami no acid sequence editors or other custom . 
computer programs; however, it shall conform to all specifications 
detailed in this section. 

(4) File compression is acceptable when using diskette media, so long as 
the compressed file is in a sel f -extracting format that will decompress 
on one of the systems described in paragraph (b) of this section. 

(5) Page numbering shall not appear within the computer readable form 
version of the "Sequence Listing" file. 

(6) All computer readable forms shall have a label permanently affixed 
thereto on which has been hand-printed or typed: the name of the 
applicant, the title of the invention, the date on which the data were 
recorded on the computer readable form, the operating system used, a ~ 
reference number, and an application serial number and filing date, if 
known. 

(b) Computer readable form submissions must meet these format 
requirements: . 

(1) Computer: IBM PC/XT/AT, or compatibles, or Apple Macintosh; 

(2) Operating System: MS-DOS, Unix or Macintosh; ^ 
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This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 



HI BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 



13 LINES OR MARKS ON ORIGINAL DOCUMENT 
□ REFERENCE^) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 



IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 






OTHER: 



